Detecting Non-modal Phonation in Telephone Speech

نویسندگان

  • Tae-Jin Yoon
  • Jennifer Cole
  • Mark Hasegawa-Johnson
چکیده

Non-modal phonation conveys both linguistic and paralinguistic information, and is distinguished by acoustic source and filter features. Detecting non-modal phonation in speech requires reliable F0 analysis, a problem for telephone-band speech, where F0 analysis frequently fails. We demonstrate an approach to the detection of creaky phonation in telephone speech based on robust F0 and spectral analysis. Our F0 analysis relies on an autocorrelation algorithm applied to the intensity-boosted and inverse-filtered speech signal and succeeds in regions of nonmodal phonation where the non-filtered F0 analysis typically fails. In addition to the extracted F0 values, spectral amplitude is measured at the first two harmonics (H1, H2) and the first three formants (A1, A2, A3). Visual and spectral inspection of the detected creaky phonation confirms the findings reported from laboratory setting. Statistical analysis using oneway ANOVA and classification using Support Vector Machine (SVM) reveals promising results which lead to further improvement for automatic detection of non-modal phonation in telephone speech.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Non-Modal Phonation in Children with Speech Disorders

The occurrence of voice disorder and of non-modal phonation were measured in the speech of 53 children. Thirty seven monolingual English speaking children with developmental speech disorder and compared with that in 16 age-matched children with normal speech development. The children’s ages ranged from 4 to 6 years. The speech disordered children were further assigned to 3 subgroups according t...

متن کامل

Characterisation of voice quality of Parkinson's disease using differential phonological posterior features

Change in voice quality (VQ) is one of the first precursors of Parkinson’s disease (PD). Specifically, impacted phonation and articulation causes the patient to have a breathy, husky-semiwhisper and hoarse voice. A goal of this paper is to characterize a VQ spectrum – the composition of non-modal phonations – of voice in PD. The paper relates non-modal healthy phonations: breathy, creaky, tense...

متن کامل

Acceleration sensor measurements of subglottal sound pressure for modal and breathy phonation quality

We present a non-invasive attempt to indirectly measure the subglottal sound pressure. This quantity opens an additional acoustical path to observe the voiced sound source. The subglottal sound pressure contours of two phonation qualities, the modal phonation quality and the breathy phonation quality, are compared. The electroglottographic signal was recorded simultaneously as a well known refe...

متن کامل

Telephone Based Voice Pathology Assessment using Automated Speech Analysis and VoiceXML

A system of remotely detecting vocal fold pathologies using telephone quality speech is presented. Using VoiceXML, a database of 631 clean speech files of the sustained phonation of the vowel sound /a/ (58 normal subjects, 573 pathologic) from the Disordered Voice Database Model 4337 was transmitted over telephone channels to produce a test corpus. Pitch perturbation features, amplitude perturb...

متن کامل

Voice Pathology Assessment based on a Dialogue System and Speech Analysis

A system of remotely detecting vocal fold pathologies using telephone quality speech recorded during a telephone dialogue is presented. This study aims at developing a dialogue system using VoiceXML for remote diagnosis of voice pathology. To assess the accuracy of the system, a database of 631 clean speech files of the sustained phonation of the vowel sound /a/ (58 normal subjects, 573 patholo...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005